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DSTTRODUCnON 

The  focus  of  this  research  was  the  development  of  a  neural  network  based 
condition  monitoring  system.  The  purpose  of  such  a  system  is  to  prolong  machinery  life 
by  ensuring  that  it  operates  within  safe  parameters.  The  goal  is  to  allow  preventive 
maintenance  to  be  performed  in  such  a  manner  as  to  prevent  catastrophic  failures  and  to 
limit  down  time. 

Currently,  there  are  two  categories  of  maintenance  that  are  performed  on 
equipment  The  flrst  form  is  corrective  maintenance.  Corrective  maintenance  is 
performed  when  a  machine  is  operated  to  the  point  of  breakdown,  at  which  time 
technicians  "correct"  the  problem  and  restart  the  machine.  The  disadvantages  of  this 
approach  are  that  there  is  often  a  lot  of  down  time  and  the  machine  may  suffer  a 
substantial  amount  of  damage,  making  repair  costly. 

The  second  form  of  maintenance  is  preventive  maintenance.  Preventive 
maintenance  is  an  integral  part  of  the  military’s  daily  routine.  In  the  military,  corrective 
maintenance  is  an  unacceptable  alternative.  Equipment  that  does  not  function  risks  lives 
and  reduces  the  effectiveness  of  a  unit’s  assets. 

However,  current  methods  of  preventive  maintenance  are  based  on  periodic 
schedules.  These  schedules  were  established  based  on  empirical  data  that  indicate  when 
a  machine  needs  to  be  serviced.  They  do  not  reflect  actual  knowledge  of  any  particular 
machine’s  operating  conditions.  The  result  is  that  machines  may  be  serviced  when  they 
do  not  need  to  be  and  may  not  get  serviced  when  they  do.  Neither  case  is  economically 


efficient 


5 


The  ideal  situation  would  be  to  incorporate  a  condition  monitoring  system  which 
allows  preventive  maintenance  to  be  scheduled  more  efficiently.  Current  methods  of 
condition  monitoring  are  based  on  analysis  by  skilled  technicians.  The  technicians 
examine  data  which  reflect  the  machine’s  operating  parameters  and  then  make  a 
judgement  on  whether  or  not  the  machine  is  operating  properly. 

This  approach  has  several  limitations.  First,  the  experience  of  the  technician  has 
a  direct  relationship  on  his/her  ability  to  distinguish  parameters  which  might  lead  to  a 
fault  condition.  Not  all  technicians  have  the  same  level  of  expertise.  Second,  the  training 
and  use  of  these  technicians  is  expensive.  Finally,  consistent  performance  is  difficult  to 
guarantee.  People  perform  di^erently  under  different  situations.  A  technician  who  has 
been  on  watch  for  six  hours  may  not  recognize  something  that  he  would  have  recognized 
had  he  just  posted  the  watch. 

The  use  of  artificial  neural  networks  may  provide  an  answer  to  some  of  these 
problems.  An  artificial  neural  network  creates  a  vector  mapping  between  two  sets  of 
data.  In  this  case,  the  input  is  a  vector  of  machine  descriptors  and  the  output  is  the 
machine’s  current  condition.  The  advantage  of  artificial  neural  networks  is  that  they  do 
not  require  a  technician,  skilled  in  data  processing,  to  create  the  mapping.  This  reduces 
cost  by  taking  people  out  of  the  loop  and  it  offers  a  much  more  consistent  level  of 
performance  since  it  is  not  subject  to  human  factors  such  as  boredom,  fatigue,  and 
incompetence. 


HISTORY  OF  NEURAL  NETWORKS 


The  development  of  artificial  neural  networks  was  the  result  of  man’s  attempt  to 
simulate  the  operation  of  the  human  brain.  The  brain  consists  of  biUions  of  densely 
packed,  interconnected  neurons.  The  connections  between  neurons  are  called  synapses. 
The  transmission  of  information  across  these  synapses  is  electro-chemical  in  nature,  with 
the  strength  of  the  signal  depending  on  the  amount  of  neuro-transmitter  released.  These 
synapses,  and  the  infonnation  that  they  transfer,  form  the  basis  for  the  operation  of  the 
human  brain. 

The  construction  of  an  artificial  neural  network  is  very  similar  to  the  model  of  the 
brain.  It  consists  a  large  number  of  processing  elements  that  are  connected  in  a  highly 
parallel  manner.  The  strengths  of  the  various  connections  are  determined  by  variable 
weighting  factors.  The  power  of  neural  networks  lies  not  in  the  complexity  of  individual 
neurons  but,  rather,  in  the  high  degree  of  parallelism  which  exists. 

Research  on  neural  networks  has  been  an  ongoing  process  for  the  last  forty  five 
years.  Interest  in  this  field  was  sparked  in  the  late  1940’s  when  McCulloch  and  Pitts 
published  a  paper  entitled,  "A  Logical  Calculus  of  Ideas  Imminent  in  Nervous 
Activity".  (Ref  #1]  Further  research,  in  particular  the  efforts  of  Hebb  and 
Rosenblatt,  led  to  the  development  of  what  became  known  as  the  Hebbian 
rule  [Ref  #2]  and  the  perceptron  [Ref  #3].  Later,  Widrow  and  Hoff 
developed  a  neural  network  based  on  the  ADALINE  (ADAptive  LINear  Element) 
perceptron  [Ref  #4].  They  correctly  theorized  that  it  would  be  possible  for  their 
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network  to  distinguish  between  two  linearly-separable  regions.  Although  this  theory  was 
valid,  their  work  was  criticized  harshly  in  the  book,  Perceotrons.  written  by  Marvin 
Minski  and  Seymour  Pappert  [Ref  #5].  After  extensive  mathematical  study, 
Minsld  and  Pappert  concluded  that  a  neural  network  based  on  the  ADALINE  perception 
was  unable  to  solve  non-linear  problems  such  as  the  exclusive-OR  problem.  They  also 
found  that  the  network  did  not  have  the  ability  to  adjust  the  weights  of  hidden  layers. 
This  severely  limited  the  amount  of  learning  that  the  neural  network  could  achieve. 
Although  some  research  did  continue  in  the  field,  this  book’s  conclusion  that  neural 
networks  did  not  warrant  further  study,  caused  both  interest  and  funding  for  smdy  to  be 
drastically  reduced. 

In  the  early  1980’s,  neural  networks  again  began  to  attract  the  attention  of 
scientists.  In  1986,  Rummelhart  refined  a  learning  process  that  had  become  known  as  the 
generalized  delta  rule  [Ref  #6],  which  was  a  means  of  adjusting  the  weights 
within  a  neural  network.  This  allowed  learning  algorithms  to  adjust  the  hidden  layer 
weights.  Since  his  findings,  interest  in  neural  networks  bas  steadily  increased.  Practical 
applications  are  continually  being  discovered.  Currently,  neural  networks  are  being  used 
for  such  applications  as  stock  market  forecasting,  signal  processing,  check  reading  and 
verification.  AT&T  uses  a  neural  network  chip  to  reduce  the  noise  on  phone  lines. 

The  advantage  that  neural  networks  offer  is  that  an  operator  does  not  have  to  have 
an  a  priori  knowledge  of  the  equations  which  relate  the  input  data  to  the  output  data. 
This  provides  a  powerful  tool  for  analyzing  data  for  slight  discrepancies  that  are  not 
immediately  obvious. 
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WHAT  IS  A  NEURAL  NETWORK? 

Although  artificial  neural  networks  are  based  on  a  model  of  the  human  brain,  they 
do  not  learn  in  the  same  manner  that  humans  learn.  The  operation  of  neural  networks  is 
statistical  in  nature.  This  was  proven  in  1989  when  University  of  California  San  Diego 
economist  Halbert  White  reduced  the  backpropagation  procedure  to  a  stochastic 
approximation.  [Ref  #7] 


Figure  1  Artificial  neuron  or  processing  element 


The  basic  element  of  the  artificial  neural  network  is  the  artificial  neuron  or 


processing  element  (Fig  1).  Each  processing  element  receives  input  from  many  other 
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neurons.  Each  input  is  multiplied  by  a  synaptic  weight  and  these  weighted  inputs  are  then 
summed  and  passed  throu/r'.  an  activation  function  to  form  the  neuron’s  output  This 
process  is  repeated  as  the  output  of  the  neuron  is  fed  into  other  neurons. 

The  activation  function  diat  is  used  within  the  individual  neuron  is  a  continuous 
function.  Typical  functions  that  are  used  for  this  purpose  are  the  sigmoid  function,  the 
sine  function,  and  the  hyperbolic  tangent  function.  These  functions  are  used  because  of 
the  ability  to  express  their  derivatives  in  terms  of  the  oiiginal  function  facilitates 
programming.  The  equation  for  the  sigmoid  function  is 

The  typical  artificial  neural  network  has  its  neurons  grouped  into  layers.  The  input 
layer  performs  no  processing  but  merely  buffers  the  input  data  and  passes  it  on  to 
succeeding  layers.  The  final  layer  is  where  network  output  is  read.  Between  these  two 
layers  may  be  any  number  of  hidden  layers.  The  purpose  of  each  hidden  layer  is  to 
extract  information  from  the  previous  layer’s  outputs  for  mapping  to  the  next  layer.  In 
general,  more  hidden  layers  equates  to  more  complex  mappings. 

Two  characteristics  are  needed  to  accurately  describe  a  neural  network.  The  first 
is  its  architecture  and  the  second  is  the  type  of  learning  algorithm  that  the  neural  network 
employs.  It  is  a  common  misconception  to  label  neural  networks  using  only  one  of  these 
specifications.  A  neural  network  caruiot  be  accurately  labeled  by  only  one  of  these 


characteristics. 
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There  are  two  basic  types  of  neural  network  architecnues.  The  first  and  most 
widely  used  is  the  feed-  forward  network.  In  the  feed-forward  network,  the  output  of 
each  processing  element  is  fed  forward  to  a  neuron  on  a  successive  layer  (Fig  2). 


Figure  2  Example  of  a  feed- forward  network 


The  second  type  is  the  recurrent  or  feedback  neural  network.  This  neural  network  makes 
use  of  a  feedback  path  from  the  output  of  one  or  more  processing  elements  to  the  input 
of  the  processing  elements  of  a  previous  layer.  The  advantage  of  a  feedback  network  is 
that,  like  sequential  circuits,  the  network  does  not  always  respond  in  the  same  to  the  same 


set  of  input  data.  It  exhibits  memory  (state).  The  network’s  output  depends  both  on  its 


inputs  and  on  its  current  state. 

Neural  network  learning  rules  are  also  classified  with  one  of  two  ways. 


Figure  3  Example  of  a  feeciback  network 
Supervised  learning  takes  place  when  both  die  output  and  input  data  are  known  (for  some 
training  set)  and  are  presented  to  the  neural  network.  The  neural  network  continues  to 
adjust  its  synaptic  weights  until  the  network  output  converges  to  the  desired  output  At 
this  point,  the  system  has  "learned”  the  training  set  of  data  and  is  ready  for  use. 

The  second  category  of  learning  is  unsupervised  learning.  It  is  most  applicable 
when  the  number  of  groupings  of  the  input  data  is  unknown.  Unsupervised  learning  does 
not  require  that  a  desired  ouqrut  data  be  presented  to  the  neural  network.  It  analyzes  the 
data  and  groups  the  input  patterns  based  on  characteristics  that  it  determines  within  the 
feature  input  space.  The  level  of  sensitivity  of  these  networks  can  be  altered,  which 
determines  the  number  of  categories  into  which  the  data  are  separated. 
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A  key  advantage  of  neural  networks  is  that  all  of  the  information  that  the  network 
uses  to  compute  its  output  is  processed  at  one  time.  The  data  are  processed  in  parallel 
vice  series  which  reduces  the  response  time  of  the  network.  Thus,  neural  networks  can 
be  used  in  real  time  applications  that  require  continuous  monitoring. 
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BACKPROPAGATION 


For  this  research,  a  feed-forward  neural  network  with  backpropagation 
(supervised)  leairdng  was  deemed  appropriate  because  of  its  success  in  the  past  The 
name  backpropagation  was  derived  from  the  manner  in  which  the  changes  in  the  synaptic 
weights  are  made.  This  algorithm  seeks  to  adjust  the  weights  in  such  a  maimer  as  to 
follow  the  steepest  gradient  descent  of  the  RMS  error  function  in  order  to  reach  a  least 
mean  square  error  between  the  actual  and  desired  output  of  the  network. 

The  backpropagation  algorithm  assumes  that  all  processing  elements  are 
responsible  for  die  output  error.  As  a  result,  output  error  is  propagated  backwards  (from 
output  to  input  layer),  in  order  to  assign  blame  to  each  weight  in  proportion  to  its  effect. 
The  backpropagation  routine  operates  in  the  following  manner.  First,  the  weights  of  the 
neural  network  are  initialized  to  small  random  values  near  zero.  If  this  is  not  performed, 
then  the  network  will  not  train  since  it  wUl  be  performing  calculations  with  matrices  of 
zeroes.  Second  the  training  vector  y  is  presented  to  the  neural  network  and  the  output 
vector  o  is  calculated,  where 
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The  weight  matrix  is 
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The  desired  output  of  the  neural  network  is 


(6) 
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The  RMS  error  of  the  systero  is  then  calculated  by  taking  the  square  of  the  difference 
between  the  desired  output  and  actual  output  data. 


where  the  subscripts  p  and  k  refer  to  the  training  pattern  and  the  netwoik  layer, 
respectively.  Next,  the  error  associated  with  each  neuron  is  calculated  and  the  weights 
are  adjusted  by  calculating  the  error,  as  a  function  of  the  weight  in  question.  Training 
continues  until  the  RMS  convergence  criterion  is  met  The  goal  of  this  procedure  is  to 
alter  the  weights  of  the  processing  elements  in  order  to  minimize  the  RMS  error.  Because 
of  the  continuous  nature  of  the  activation  function  (sigmoid),  the  RMS  error  never  reaches 
zero. 

This  process  can  be  visualized  by  examining  a  marble  that  is  placed  on  top  of  a 
large  mountain  range  that  contains  many  local  minima  (valleys).  The  goal  is  to  roll  the 
marble  l  wn  the  mountain  range  so  that  it  meets  two  criteria.  First,  it  is  desirable  for  the 
marble  to  follow  the  steepest  path  in  order  to  allow  it  to  get  to  its  destination  quickly. 
Second,  the  marble  should  come  to  rest  at  the  lowest  point  on  the  mountain  range  in  order 
to  minimize  the  RMS  error,  which  is  symbolized  by  the  marble’s  altitude. 

The  rule  which  governs  the  marble’s  descent  down  the  mountain  range  is  the  delta 
rule.  It  determines  the  size  of  weight  adjustment  within  the  neural  network.  It  is  given 
by: 
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where  the  subscripts  j  and  k  refer  to  the  node  and  layer,  respectively.  The  constant^  is 
the  learning  coefficient,  5^  is  the  error  signal  as  a  function  of  the  weights  and  y  is  the 
input  vector.  At  this  point,  the  delta  rule  can  be  derived.  Each  individual  weight 
adjustment  is  given  by 


BE 


(9) 


where  the  error  is  determined  as  in  Eqn  (2).  For  each  node  in  layer  k,  k=l,2,...K  the 
following  holds: 


J^i 


(10) 


where  the  subscript  j  refers  to  each  node  within  each  layer.  Each  neuron’s  output  is 


(11) 
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The  error  signal  term  8^  provided  by  the  k'‘‘  neuron  is  defined  for  this  layer  as 


BE 


(12) 


Since  the  gradient  component  depends  only  on  the  net„  of  a  single  neuron,  and  since  the 
output  of  the  k*  neuron  is  contributed  to  only  by  weights  j=l,2,.  J  for  a  fixed  K,  the 
chain  rule  allows  us  to  write 


BE  _  BE  S(net^ 

Bwgj  B(neti^ 


Since  the  values  of  yj  for  j=l,  2,  ...  J,  coirespond  to  a  fixed  pattern  at  the  input. 


(13) 
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Combining  Eqn(12)  and  Eqn(15)  yields 


BE  , 


(IS) 


Therefore 


(16) 
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Since  E  is  a  function  of  net^ 


(17) 


and  from  Eqn(12) 


aE. 

do^  dCnet^ 


(18) 


The  second  term  in  the  product  of  eqn  (16)  is  the  derivative  of  the  activation  function 


/(net^A 


aC/ietj) 


(19) 


and 


(20) 


Thus 


(21) 
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which  gives  the  delta  rule 


Aw^=Ti(d'^-04)/i(n«rj)y^ 


(22) 


From  this  result,  the  generalized  delta  rule  can  derived.  This  derivation  will  not  be 
performed,  however  the  equation  governing  the  generalized  delta  rule  is  given  by 


m. 


t-i 


(23) 


where  v  is  the  weight  matrix  of  the  connections  to  the  hidden  layer,  and  is  the  input 
to  the  hidden  layer,  w  denotes  the  synaptic  weights  of  the  layer  following  the  hidden 
layer. 

The  generalized  delta  rule  can  be  modified  so  that  it  includes  a  momentum  term 
which  acts  as  a  low  pass  filter,  since  general  trends  within  the  weight  space  are  reinforced 


Aw^,=-TiVE(0+PA>v(f-l) 


(24) 


while  high  frequency  variations  are  ignored.  This  allows  smaller  learning  coefficients  to 
be  used  which  in  turn  let  the  neural  network  examine  die  weight  space  in  greater  detail. 
In  Eqn  (24)  the  momentum  term  is  given  by  6,  where  t  and  (t-1)  indicate  the  current  and 
most  recent  training  step.  Using  the  marble  analogy,  the  addition  of  the  momentum  term 
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helps  to  prevent  the  marble  from  becoming  stuck  in  a  shallow  valley  rather  than  the  deep 
crevice  for  which  it  is  searching. 

The  advantages  of  using  a  multi-layer,  feed-forward  backpropagation  network  are 
that  it  is  relatively  easy  to  implement  and  has  powerful  classification  abilities.  The  most 
noteworthy  of  these  abilities  is  the  classification  of  non-lineariy  separable  pattern  classes. 
This  allows  neural  networks  to  be  used  to  solve  a  wide  variety  of  problems  such  as  the 
exclusive-or  problem 
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VroRATION  MONITORING 

Vibration  monitoring  "consists  of  acquiring  vibration  signals  and  using  them  as 
information  carriers."  [Ref  #8]  This  process  is  suitable  for  this  project  for  the 
following  reasons:  (1)  vibration  monitoring  provides  a  non- intrusive  means  of  examining 
the  system;  and  (2)  reliable  transducers,  which  can  sense  these  vibrations,  are  readily 
available.  Of  course,  the  most  important  reason  is  that  the  vibration  spectrum  is  strongly 
affected  by  the  machinery  operating  condition. 

Several  analysis  methods  are  used  in  traditional  vibration  monitoring  systems.  The 
most  widely  used  is  frequency  domain  analysis.  Since  we  must  sample  the  vibration  data, 
the  conversion  from  time  domain  to  frequency  domain  is  accomplished  by  taking  a 
discrete  Fourier  Transform  (DFT)  of  the  sequence.  The  DFT  X(k)  of  a  sampled  time 
waveform  x(t)  is  given  by: 

AT 

Computation  time  for  the  FFT  is  of  order  0(n*)  where  n  is  the  number  of  elements  in  the 
sequence.  For  large  sequences,  this  becomes  burdensome.  However,  if  n  is  a  power  of 
2,  a  Fast  Fourier  Transform  (FFT)  may  be  employed.  This  reduces  computation  time  to 
the  order  0(nlog(n)). 

Analysis  in  the  frequency  domain  provides  useful  information  when  the  time 
domain  data  describe  oscillating  phenomena  such  as  those  that  are  encountered  in  rotating 
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machinery.  Many  of  the  faults  that  occur  in  these  types  of  machinery  result  in 
perturbations  that  occur  at  specific  frequencies  (Table  I).  Ideally,  the  neural  netwoiic  will 
be  able  to  identify  these  frequencies  and  relate  them  to  their  respective  fault  conditions. 


Table  I:  Typical  rotating  machinery  faults  and  the  frequencies  that  these  faults  produce. 


Source 

Frequency 

Imbalance 

f,  (frequency  of  shaft  rotation) 

Bent  Shaft 

^2^d  4f, 

Mechanical  Looseness 

2fr 

Faulty  Belt  Drive 

multiple  of  belt  frequency 

1  Misaligned  Race 

{±2{, 

Several  other  methods  are  also  of  interest  when  dealing  with  rotating  machinery. 
One  such  method  is  the  cepstrum.  The  cepstrum  is  defmed  as  the  inverse  fourier 
transform  of  the  logarithm  of  the  power  spectrum.  Cepstrum  analysis  has  been  praised 
because  of  its  ability  to  filter  out  the  effects  that  the  structure  of  the  device  may  have  on 
the  measured  signal.  W.D.  Strunk  found  cepstrum  analysis  to  be  especially  helpful  in 
identifying  bearing  problems  in  pieces  of  rotating  machinery.  [Ref  #9] 

Structural  masking  is  a  phenomenon  that  occurs  when  the  vibration  signal  is 
modiffed  by  the  dynamic  properties  of  the  structure.  In  some  cases,  this  modification 
can  be  severe,  resulting  in  data  that  are  representative  of  the  structure  rather  than  the 
machinery’s  vibration  signal  [Ref  #8].  This  is  desirable  for  some  applications.  If  the 
integrity  of  the  structure  is  a  concern,  then  data  which  are  representative  of  the  structure 
are  useful.  In  this  study  however,  care  should  be  taken  to  ensure  that  the  data  represent 
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the  machinery. 

The  focus  of  this  research  did  not  require  the  use  of  powerful  feature  extraction 
techniques.  The  spectrum  alone  provided  sufficient  information  for  classification. 
However,  vibration  analysis  serves  a  valuable  purpose.  The  ability  to  examine  the 
vibration  data  of  a  piece  of  machinery  allows  humans  to  study  the  condition  of  the 
machine.  The  data  in  Table  I  were  obtained  from  vibration  analysis  of  machine.  By  using 
vibration  analysis,  the  amount  of  information  that  is  presented  to  the  neural  netwoiii  can 
be  reduced  drastically.  This  would  make  the  neural  netwoiic  more  efficient.  Tlie 
disadvantage  is  that  this  vibration  analysis  is  time  consuming.  In  this  project,  an  attempt 
was  made  to  limit  the  amount  of  resources  that  were  devoted  to  vibration  analysis 
methods. 
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DESIGN  OF  THE  PHYSICAL  SYSTEM 


Tlte  physical  system  that  serves  as  the  model  for  our  rotating  machine  is  a  simple 
device  designed  around  a  small  AC  motor.  The  AC  motor  is  a  1/3  horsepower  Dayton 
Wattrimmer,  model  number  3M384.  The  motor,  or  main  shaft  turns  at  1075  rpms.  It  is 
coupled  to  a  secondary  shaft  via  a  timing  belt 


Load  Scrsw 


Acoelerometer 


Figure  4  Diagram  of  the  machine 

All  of  the  testing  was  performed  on  the  secondary  shaft  This  shaft  is  attached 
to  the  framework  of  an  aluminum  housing  that  serves  to  support  the  shaft  (see  Figure  4). 
The  shaft  passes  through  two  sets  of  radial  bearings  which  enable  the  shaft  to  rotate.  In 
addition,  it  is  possible  to  add  a  load  to  the  shaft  by  tightening  a  bolt  that  is  on  top  of  the 


PHYSICAL  SYSTEM 


Bearing  Housing 


Pulley  used  to  connect  the  secondary 
shaft  to  the  motor. 
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aluminum  housing. 

Each  bearing  on  the  secondary  shaft  is  surrounded  by  a  housing.  These  housings 
can  be  filled  with  a  fluid  or  substance  to  accelerate  bearing  wear.  A  drain  is  provided  so 
that  the  current  bearing  lubricant  can  be  drawn  off  by  rinsing  the  bearings  with  a  solvent 

The  vibration  data  that  are  collected  from  this  work  are  gathered  from  an 
accelerometer,  which  was  mounted  radially  above  and  on  the  side  of  the  aluminum 
housing.  The  accelerometer  was  mounted  via  a  10-32  bolt  that  protrudes  from  the 
aluminum  housing.  The  location  of  the  accelerometer  used  for  the  project  can  be  viewed 
in  Figure  4. 

Several  faults  can  be  induced  in  the  system.  Bearing  wear  is  produced  by  the 
addition  of  a  corrosive  element  to  the  bearing  lubricant  Overloading  the  shaft  can  be 
simulated  by  the  aforementioned  technique.  A  fault  can  also  be  induced  by  altering  the 
tension  in  the  timing  belt  Loosening  the  belt  will  cause  it  to  slip,  resulting  in  a  new  fault 
condition.  The  belt  could  also  be  worn  so  that  the  integrity  of  the  interface  between  the 
belt  and  the  gear  is  damaged.  This  could  cause  the  belt  to  slip. 
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DESIGN  OF  THE  MONITORING  SYSTEM 

The  monitoring  system  consisted  of  the  accelerometer  which  was  mounted  on  the 
frame  of  the  machinery.  The  signal  from  the  accelerometer  was  ampliHed  and  then 
passed  through  an  A/D  converter.  A 
software  package  then  computed  an  FFT 
of  this  time  domain  data. 

The  amplifier  that  was  constructed 
was  built  using  a  high  input  impedance 
op-amp,  the  LH0084.  The  schematic  for 
this  amplifier  is  included  in  Appendix  I. 

It  was  chosen  for  its  gain  and  precision. 

The  peak  to  peak  amplitude  of  the  signal  from  the  transducers  is  approximately  150  mV. 
Since  the  accelerometer  was  uncalibrated,  it  is  unknown  how  many  g’s  this  corresponds 
to.  This  is  unimportant,  however,  since  the  interest  is  in  spectral  lines  and  not  absolute 
amplitudes.  A  gain  of  100  (ampliHed  so  that  magnitude  is  between  -7.5  V  and  7.5  V)  was 
used  so  that  the  input  signal  covered  a  large  portion  -lOV  to  +10  V  range  of  the  A/D 
converter.  This  increased  the  resolution  of  the  system. 

Some  have  advocated  the  use  of  a  high  pass  filter  in  conjunction  with  the 
previously  mentioned  components.  Hie  high  pass  Hlter  eliminates  frequencies  below  a 
certain  threshold  frequency.  The  vibrations  that  are  typically  associated  with  bearing 
vibrations  are  located  in  the  higher  frequencies.  It  is  was  decided  to  not  use  a  filter 
because  of  the  neural  network’s  ability  to  extract  the  features. 


Figure  5  Flow  chart  of 
condition  monitoring  system 


The  A/D  converter  used  a  part  of  a  Data  Translation  DT2801a  data  acquisition 
card.  A  program  was  written  to  control  this  device  as  well  as  to  compute  the  FFf  of  the 
time  domain  data  thus  collected.  This  information  is  stored  in  a  file  and  used  to  train  the 
neural  network.  Software  to  perform  these  functions  was  available  in  the  C  programming 
language.  This  program  is  included  in  Appendix  II.  It  further  allows  viewing  of  the  data 
in  either  the  time  domain  or  frequency  domain  (with  or  without  a  Harming  window). 

A  Hewlett  Packard  HP3562A  dynamic  signal  analyzer  and  an  oscilloscope  were 
also  available  for  this  study.  These  devices  were  used  to  examine  various  aspects  of  the 
system  quickly,  so  as  to  select  appropriate  sampling  characteristics  for  the  A/D  converter. 

The  heart  of  the  neural  network  based  monitoring  system  is  the  software  package, 
Neural  Works  Professional  H/Plus.  This  package  was  used  to  construct  and  test  all  of  the 
neural  networks  created  in  this  research.  It  is  highly  versatile  in  that  it  supports  many 
different  learning  algorithms  and  network  architectures. 


RESULTS 


The  results  of  this  research  were  obtained  from  a  series  of  experiments  that  were 
designed  to  fulfill  the  stated  objectives.  These  included  a  study  of  disturbances,  transient 
conditions,  and  various  fault  conditions.  The  belt  faults  mentioned  earlier  in  the  report 
were  not  used  due  to  time  constraints. 

Transients 

The  transient  that  was  examined  in  this  project  was  the  vibrational  signature  of  the 
machine.  This  signature  varies  during  the  period  between  warmup  from  a  cold  start  to 
the  point  at  which  it  reaches  a  steady  state.  Essentially,  this  problem  is  similar  to  the 
problem  that  a  person  has  with  an  old  automobile  on  a  cold  day.  The  machinery  in  this 
experiment,  like  an  automobile,  took  a  while  to  warm  up  and  its  performance  was  erratic 
until  it  did  so.  The  purpose  of  this  portion  of  the  project  was  to  examine  the  spectral  data 
taken  on  the  machine  after  a  cold  start  in  order  to  determine  if  the  network  could  be 
trained  to  treat  this  transient  condition  as  a  normal  operating  condition. 

Data  for  this  first  study  were  collected  in  five  second  segments  at  one  minute 
intervals  for  thirteen  minutes.  In  each  five  second  interval,  thirty  256  point  FFTs  were 
computed.  Only  the  positive  frequency  components  of  die  signal  were  retained  due  to  the 
redundancy  present  The  resulting  128  point  FFT  spectrum  was  presented  to  the  neural 
network  as  input  The  frequency  range  that  was  used  extended  from  0  to  5  KHz.  The 
neural  network  that  was  trained  consisted  of  a  128  node  input  layer,  a  1  node  hidden 


layer,  and  a  1  node  output  layer  (128-1'1).  The  activation  function  used  was  a  sigmoid 
function  and  the  delta  rule  was  used  to  train  the  network.  The  desired  netwoik  output  for 
each  piece  of  training  data  is  listed  in  Table  D.  The  values  for  desired  output  are  values 
that  were  chosen  to  represent  specific  known  conditions  of  the  rotating  machine. 


Table  11:  Training  criteria  used  to  classify  transients 
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U _ 3 
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The  goal  was  to  prove  that  the  neural  netwoik  could  (1)  train  to  recognize 
specific  transient  conditions  and  (2)  make  classifications  on  sets  of  transient  data  that 
it  had  never  seen  before.  The  theory  to  be  tested  in  this  experiment  was  that  the 
changes  in  the  machinery's  frequency  spectrum  vary  smoothly  during  this  startup 
period. 

The  results  of  this  experiment  indicated  that  it  was  possible  for  the  neural 
netwoik  to  make  a  determination  of  the  length  of  time  after  startup  that  a  set  data  was 
taken.  This  can  be  seen  in  Figures  6  and  7.  The  bar  chart  in  Figure  6  shows  the 
response  of  the  neural  netwoik  to  the  set  of  data  on  which  it  was  trained.  This  is  not 
surprising  in  itself  because,  the  network’s  response  to  the  training  data  gain  be 


estimated  by  looking  at 
the  RMS  error.  The 
information  that  is  of 
interest  is  how  the  neural 
networic  responds  to  the 
data  taken  at  times  on 
which  it  was  not  trained. 
Figure  7  shows  this  graph 
and  it  is  from  this  graph 
that  certain  conclusions 
can  be  drawn. 

The  bar  chart  in 
Figure  7  supports  the 
conclusion  that  the 
frequency  spectrum  vary 
smoothly.  It  also  supports 
the  conclusion  that  a 
neural  network  can  be 
trained  to  recognize 
transient  conditions  in 
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E^torminadion  of  Transient  States: 
Network  Response  to  Training  Data 


as 


1  3  S  7  S  11 


I _ Uma  ftnini _ I 

Figure  6  shows  the  network's  response  to 
the  training  vector.  Results  should  be 
sinvilar  to  values  in  Table  II. 

beterrininalion  of  Transient  Stales ; 

Network  Response  to  Test  Data 


as 


2  4  6  8  10  12 


1  nm«  /niin> _ | 

Figure  7  shows  network's  response  to  the 
test  vector.  The  similarity  of  these 
results  to  the  results  in  Figure  6  proves 
the  validity  of  this  method. 


order  to  prevent  it  from  misclassifying  Uiem  as  fault  conditions. 

This  test  was  not  exhaustive,  however.  There  may  be  other  transient  conditions 
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on  which  the  neural  network  was  trained  which  would  look  like  fault  conditions. 
Testing  must  be  undertaken  on  each  transient  to  ensure  that  the  network’s 
classification  is  accurate. 

Loading  Effects 

As  discussed  in  die  construction  details  of  this  paper,  the  physical  system 
possessed  a  means  of  varying  the  radial  loading  placed  on  the  secondary  shaft  This 
was  accomplished  by  tightening  and  loosening  the  load  screw  on  top  of  the  machinery. 
The  purpose  of  this  experiment  was  examine  the  ability  of  the  neural  network  to 
identify  different  loading  conditions  applied  to  the  secondary  shaft  The  data  used  to 
train  the  network  were  again  in  the  form  of  a  128  point  spectrum. 

For  this  experiment  two  sets  of  data  were  taken.  The  first  was  the  control  set 
These  data  were  taken  with  no  loading  placed  on  the  shaft  The  second  set  of  data 
were  taken  after  the  load  screw  had  been  tightened  to  a  hand  tight  position.  These 
data  served  as  an  example  of  the  fault  condition.  The  data  were  taken  in 
approximately  15  second  segments  (around  90  to  a  100  FFTs  were  taken  during  this 
time)  and  were  used  to  train  a  network.  The  network  used  in  this  experiment  was 
again  a  128-1-1  network. 

The  network  was  successfully  trained  on  these  data.  However,  when  the 
network  was  presented  a  set  of  data  upon  which  it  had  not  been  trained,  it  failed  to 
make  an  accurate  classification.  These  data  were  collected  under  identical  conditions 


as  the  training  data,  but  they  were  taken  at  different  times.  The  reason  for  this  failure 
was  that  the  information  that  was  presented  to  the  neural  network  was  not  a 
statistically  valid  set  of  data.  If  die  information  presented  to  a  neural  network  does 
not  contain  the  information  necessary  to  identify  the  various  features  of  the  training 
sets  then  the  neural  network  cannot  accurately  classify  the  pattern.  This  is  often 
referred  to  as  the  "garbage  in,  garbage  out"  theory. 

A  question  that  might  be  a^d  is,  "How  did  the  neural  network’s  RMS  error 
converge  if  die  neural  network  was  not  presented  a  valid  set  of  data  upon  which  to 
train?"  The  answer  to  this  question  is  that  the  neural  network’s  classifications  were 
erroneously  based  on  differences  within  pattern  space  which  resulted  from  disturbance 
noise.  The  descriptors  chosen  to  train  the  network  were  not  sufficient  to  separate  the 
pattern  classes.  The  successful  convergence  of  the  neural  network  during  training 
indicates  that  the  convergence  was  based  on  spurious  patterns  contained  within  die 
descriptors  that  did  not  contain  information  on  the  nature  of  a  fault. 

Another  possible  reason  for  the  failure  to  classify  the  training  data  was  the 
possibility  of  changing  vibration  characteristics  of  the  machine.  As  a  result,  another 
test  was  devised.  This  test  involved  taking  seven  sets  of  data  for  both  of  the  loaded 
and  normal  conditions  of  the  machine  and  training  the  network  on  this  information. 
Again,  the  network  successfully  trained  on  these  data  but  failed  to  accurately  classify 
the  test  data.  This  supported  the  earlier  conclusion  that  the  data  presented  to  the 
network  were  not  statistically  valid. 


Fault  Conditions 


The  most  important  aspect  of  this  project  is  the  neural  network’s  ability  to  detect 
and  classify  fault  conditions.  If  the  network  caimot  accurately  classify  and  detect  these 
faults,  then  the  ability  to  handle  transients  and  disturbances  is  wasted.  The  vibrations  that 
are  being  studied  ate  the  bearing  vibrations  of  the  piece  of  machinery  under  three 
conditions  of  bearing  wear.  The  conditions  are  listed  in  the  following  table  along  with 
their  appropriate  desired  ouqiut  data. 


Table  ID:  Desired  response  of  the  output  nodes  to  various  fault  conditions. 


Condition 

Output  Node  1 

Ouqiut  Node  2 

Normal 

0 

0 

Damaged 

0 

1 

1  Non-lubricated 

1 

1 

The  normal  bearing  condition  consisted  of  a  bearing  that  was  properly  lubricated 
and  free  from  any  defects.  The  damage  induced  on  the  bearing  in  the  next  case  was 
achieved  by  using  a  screwdriver  to  deform  die  housing  near  one  of  the  individual  ball 
bearings.  Figure  7  shows  a  diagram  of  the  bearing  used  and  the  location  of  its  damage. 
The  final  condition  was  a  bearing  stripped  of  its  lubricant  by  soaking  it  in  a  slighdy 
acidic  solution  overnight 

The  neural  network  used  to  examine  this  particular  problem  consisted  of  a  128- 
node  input  layer,  a  2-node  hidden  layer,  and  a  2-node  output  layer.  The  frle  used  to  train 
the  network  was  created  by  appending  three  separate  files,  each  of  which  contained  a  five 


second  segment  (30 
FFTs).  The  network  was 
successfully  trained. 

The  neural  network 
responded  as  well  to  the 
test  data  as  it  did  to  the 
data  on  which  it  was 
trained.  This  indicates 
that  the  neural  network 
had  the  ability  to  discern 
these  faults  and  to  generalize  sufficiently  to  disregard  noise.  Table  m  contains  the 
average  values  of  the  network’s  output  for  the  test  set  of  data 
Examples  of  each  of  the  128  point 
FFT  spectrums  that  were  used  to  train  the 
network  is  included  in  Figures  9  through 
11.  As  can  be  seen,  the  visual  detection 
of  differences  within  the  various 
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Figure  10  Frequency  Spectrum  of 
non-lubricated  bearing 


BEARING 


Figure  8  Bearing  with  damage  location 
indicated 


frequency  spectrum  is  often  difficult 
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Disturbances 

When  taking  measurements  using 

an  accelerometer  as  the  sensor,  the  effect 

of  noise  on  the  system  must  be  examined. 

For  practical  purposes,  noise  is  defined  to 
% 

include  sensor  noise  and  any  signal  that 
does  not  originate  from  within  the  system. 
Previous  studies  have  indicated  diat  neural 
networits  are  robust  and  have  the  ability  to 
filter  out  noise  in  the  same  manner  that  a 
differential  amplifier  reduces 
noise.  [Ref  #10] 
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Figure  9  Frequency  Spectrum  of 
Normal  Bearing 
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Table  IV:  Response  of  the  output  layer  node’s  to  the  test  data  for  various 


fault  conditions.  Notice  the  similarity  to  Table  m. 


Condition 

Output  Node  1 

Output  Node  2 

Normal  Condition 

.170 

.024 

Damaged 

.968 

.013 

Non-Lubricated 

.990 

.981 

A  differential  ampliffer  makes  use  of  the  fact  that  the  noise  on  each  of  its  two 


input  leads  will  be  similar  if  the  leads  are  in  close  proximity  to  one  another  (ie.  common¬ 
mode  noise).  The  amplifier  computes  the  difference  of  the  signals  on  the  two  leads  which 
cancels  out  the  noise.  In  a  neural  network,  the  noise  is  distributed  evenly  across  each 
channel.  Given  that  there  are  a  substantial  number  of  input  channels,  the  neural  network 
is  able  to  filter  out  the  noise. 

The  network’s  response  to  disturbances  is  important  If  the  network  output  is 
unduly  influenced  by  disturbance  noise,  then  the  conclusions  that  the  network  reaches  are 
not  accurate.  In  any  sort  of  condition  monitoring  system,  a  high  level  of 
misclassifications  carmot  be  tolerated.  Urns,  it  is  important  to  ensure  that  disturbances 
do  not  influence  the  network’s  output 

The  first  approach  to  this  problem  was  to  determine  if  disturbances  had  a 
detrimental  effect  on  the  network’s  ability  to  make  classifications.  This  was  accomplished 
by  training  the  network  to  recognize  certain  fault  conditions  and  then  subjecting  the 
system  to  disturbances.  The  trained  network  was  then  tested  on  these  new  data.  The 
result  was  that  the  output  of  the  neural  network  varied  with  the  disturbances,  which 
indicated  that  the  disturbances  had  a  negative  influence  the  network’s  ability  to  make  a 
classification. 

The  next  step  involved  determining  if  the  network  could  be  trained  to  classify 
data  accurately  in  the  presence  of  disturbance  noise.  This  test  was  implemented  by 
treating  each  disturbance  as  a  normal  condition  and  then  training  the  neural  network  on 
it  as  well  as  normal  conditions.  This  was  successful.  The  neural  network  was  able  to 
train  on  these  data.  However,  upon  further  study  it  became  apparent  that  this  experiment 
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was  too  naiTowIy  confined. 

The  data  on  which  the  netwoiic  was  tested  consisted  of  vibration  data  collected  at 
the  same  time  that  the  test  data  were  gathered.  This  meant  that  the  netwoiies  ability  to 
detect  this  specific  disturbance  did  not  correlate  to  how  it  would  detect  disturbances  of 
a  different  nature.  This  conclusion  was  confirmed  upon  further  testing  of  the  networic. 
In  this  test,  data  on  disturbances  of  different  types  were  collected.  The  network  could  be 
trained  on  specific  disturbances,  but  did  not  have  the  ability  to  make  generalizations  on 
other  disturbances.  The  problem  with  this  system  is  that  there  is  no  guarantee  that  the 
network  could  detect  disturbances  upon  which  it  was  not  trained. 

A  neural  network  based  condition  monitoring  system  roust  be  able  to  handle  all 
disturbance  conditions  that  it  might  face.  These  disturbances  may  vary  in  magnitude  or 
frequency,  but  the  network  is  still  required  to  identify  them.  The  solution  that  was  used 
in  this  problem  was  simUar  to  a  method  used  by  Chow  and  Yee.  Their  procedure 
involved  the  use  of  a  series  of  time  delay  mechanisms  which  fed  the  same  information 
to  the  neural  network  at  different  times.  [Ref  #1 1] 

The  technique  used  in  this  experiment  was  a  time  averaging  of  the  output  values 
of  the  neural  network.  By  averaging  the  values  of  the  network’s  output  over  50 
iterations,  the  general  trend  of  the  data  quickly  became  apparent  This  can  be  verifted 
visuaUy  by  looking  at  the  output  of  the  neural  network  (see  Figure  12).  The  desired 
output  for  this  data  was  0.9.  The  basis  for  this  technique  is  the  fact  that  no  classification 
of  fault/no  fault  for  a  piece  of  machinery  should  be  based  on  a  single  128-point  FFT. 

Care  must  be  taken  to  ensure  that  the  standard  deviation  of  network’s  ou^ut  is  not 


exceptionally  large.  The 
standard  deviation  was  not 
calculated  in  this 
experiment,  but  the  outputs 
were  perused  visually  to 
determine  if  a  general  trend 
did  exist  within  the  neural 
network’s  output  Figure 
13  contains  a  sample  of 
data  where  the  standard 
deviation  is  large.  No 
conclusions  can  be  drawn 
on  these  data  because  of 
this  erratic  nature. 
However,  a  trend  did  exist 
in  most  of  the  data  which 
was  tested.  The  network 
output  values  tended  to 
form  around  a  certain  of 
output  as  was  expected 
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Network  Output  to  Disturbance  . 
Shows  trerKi  exist  within  the  data 


- ActiMl  VakM  —  Atftrag*  Vahj* 


Figure  12  The  network's  output  tends  to 
vary  around  a  mean  of  .861  with  a  small 
standard  deviation. 


Network  Output  to  Disturbance; 
Shows  large  deviation  from  mean 


—  AOiMi  VahJ*  —  Avtrag*  VahJ* 


Figure  13  The  network's  output  in  this 
graph  is  volatile.  The  high  standard 
deviation  makes  it  difficult  to  consider 
these  data  as  valid. 


(Figure  12). 

The  time  averaging  technique  used  in  this  portion  of  the  experiment  acted  as  a 
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filter  which  reduced  the  effects  of  disturbances  within  the  network.  It  is  important  to  note 
that  this  filter  was  not  constructed  to  filter  out  disturbances  which  were  produced  by  the 
system  itself.  The  goal  was  to  eliminate  disturbances  that  were  created  external  to  the 
system  and  served  to  degrade  the  actual  system  data.  The  information  that  is  being 
examined  with  respect  to  the  system’s  vibration  characteristics  is  periodic  and  should  still 
be  reflected  in  the  frequency  spectrum  that  is  presented  to  the  neural  network. 

Problems  Encountered 

One  of  the  largest  problems  faced  during  the  course  of  this  research  was  the 
changing  vibrational  characteristics  of  the  physical  system  itself.  There  were  a  lot  of 
parts  to  the  physical  setup  that  could  be  altered  and  adjusted.  Each  time  the  machine  was 
adjusted,  there  existed  the  possibility  that  this  would  in  some  way  change  the  frequency 
spectrum  of  the  machine.  This  changing  spectrum  was  a  cause  of  concern  because  of  the 
direct  influence  that  it  played  in  the  networks  ability  to  detect  and  classify  faults. 

In  order  to  eliminate  this  effect,  all  of  die  data  for  each  specific  fault  condition 
were  collected  at  one  time,  without  changing  the  setup.  However,  in  most  cases  when 
the  specific  condition  was  simulated  again  after  a  change  in  the  setup,  the  neural  network 
was  not  able  to  correctly  diagnose  the  condition  of  the  machinery.  This  network 
robustness  problem  needs  to  be  addressed  in  the  future. 

Another  potential  problem  that  was  examined  dealt  with  the  possibility  that  the 
neural  network’s  ouqiut  was  a  function  of  a  few  select  input  nodes.  Such  a  feature  of  the 


neural  network  is  not  desirable  because  it  indicates  that  the  network’s  success  is  too 
dependent  on  these  few  input  nodes.  This  possibility  was  raised  due  to  the  high  level  of 
the  network’s  sensitivity,  as  evidenced  by  its  training  on  disturbance  noise  and  its 
response  to  slight  variations  in  the  frequency  spectrum. 

This  theory  wtu.  tested  in  this  project  by  looking  at  the  network’s  response  after  twenty 
of  the  network’s  nodes  had  been  disabled.  This  experiment  was  repeated  three  times  with 
twenty  different  nodes  each  time  and  the  network  stiU  responded  accurately  to  the  input 
data.  This  technique  was  also  used  in  a  report  written  by  Prof  Clement  and  similar  results 
were  achieved.  [Ref  #12] 
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CONCLUSION 

Based  on  the  results  of  this  project,  several  interesting  conclusions  can  be  drawn. 
The  first  is  that  the  best  way  to  handle  disturbance  noise  that  is  generated  external  to  the 
physical  system  being  studied  is  to  implement  some  sort  of  filtering  technique.  In  this 
project,  the  time  averaging  technique  was  found  to  be  satisfactory. 

This  experiment  demonstrated  that  a  balance  must  achieved  in  terms  of  the  amount 
of  pre-processing  that  needs  to  be  done  to  the  signal  in  order  to  ensure  that  the  neural 
nctwotk  can  perform  the  required  feature  extraction.  There  is  limited  research  in  this 
area,  which  is  why  current  research  into  neural  network  based  condition  monitoring 
systems  relies  heavily  on  pre-processing  the  data.  This  project  attempted  to  show  that 
such  pre-processing  was  unnecessary.  The  results  for  this  portion  of  the  project  were 
mixed.  In  general,  the  level  of  pre-processing  that  is  required  must  be  determined  for 
each  set  of  data. 

Another  conclusion  that  can  be  drawn  from  this  project  is  the  ability  of  the  neural 
network  to  identify  re-occurring  transients.  In  order  to  recognize  these  transients,  the 
neural  network  must  break  die  transient  into  a  series  of  steps  and  train  on  all  of  the  steps. 
Essentially,  the  network  treats  each  transient  step  as  a  normal  operating  condition  and 
classified  the  information  appropriately. 

The  final  conclusion  that  can  be  drawn  is  the  fact  that  the  backpropagation  neural 
network  does  not  have  the  ability  to  make  classifications  on  a  condition  on  which  it  has 
not  been  trained.  If  the  data  were  not  included  in  the  training  set,  then  the  neural 
network’s  output  cannot  be  accurately  predicted.  The  network’s  output  is  said  to  be 
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unconstrained  for  these  sets  of  data. 

The  results  of  this  project  indicate  that  there  is  a  future  for  neural  network  based 
condition  monitoring  systems.  They  are  valuable  tools  that  can  help  the  Navy  with  its 
machineiy  diagnostics  problems.  Future  study  should  focus  on  constructing  an 
operational  neural  networic  that  incoiporates  all  of  the  various  aspects  that  have  been 
addressed  in  this  project 

Further  study  must  also  be  done  on  determining  the  optimal  architecture  for  each 
application  of  neural  networks.  Currently,  the  trial  and  error  method  is  used  until  a 
satisfactory  neural  network  is  found.  This  is  inefficient  A  set  of  guidelines  would  save 
a  lot  of  time  and  effort 
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Amplifier 


APPENDIX  n 

Data  Handling  Program 
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/*  ’vmi.c- teal  dMa  Aon  imaOlcMd.  •/ 
/*  wrioea  by  AMialaal  Ptofaiot  Winiam  Clemeac 


•iachuk  <iiialhh> 


#iidiKle  <sldlft>Ji> 
biadnde  <dafJi> 

•iadude  'pdeiiiJi”  /*  ladode  PCLAB  cnor  oodet  for  Toriio  C  */ 

diadade  "pddrfi.h*  ladode  PCXAB  foacdoa  dedantioea  far  Totbo  C  */ 


typedef  finict  oomiileu  { 

): 


dadUe  Re,Im: 


diadade  "fifUacb* 
diadode  ’’pjib.h* 


ddefiae  POINTS 


256 


/*  a  power  of  2*/ 


ddefiae  TIMING^OORCE  0 
ddefiae  STARTjCHANNEL  0 
ddefiae  ENDjCHANNEL  0 

ddefiae  GAIN  1 


/*  Software  aigfer,  ioterai]  dock  */ 


/*  A/D  envenioo  gaia  */ 


eaten  oBsigaed  .ftidea  *  0i2i000; 


/*  Set  itack  nm  to  81C.  */ 


/*: - •/ 

mainO 

( 


iat 

i,  j,  *ad_data,  errorM),  keyjapot,  optioB,  fileJIagsO; 

float 

dock_Cteqiieacywl 0000.0; 

/*  Block  read  fieq.  (HZ)  */ 

doable 

*tiine_axis,  *fie^aai<,  *data; 

doable 

oet^ootaO; 

char 

faainellO]; 

ITLE 

•tp: 

if  (  (ad_daiaB(iBt  *)  caIlac(POINTS.  fizeofQot))>BBNULL  ) 
eail(l); 

if  (  (uwejuutsfdoMe  *)  calloc  (POINTS,  aizeof(doabie)))sENl}LL  ) 
eait(l): 

if  (  ^eq_aaiss<daable  *)  calloc  (POINTS,  fizecf(doabie))>ssNULL  ) 
eait(l); 

if  (  (datwKdooUe  •)  calloc  (POINTS,  aaeof(doiible))^NULL  ) 
eait(I); 

for  0=0;  i<POINTS-,  H-f)  ( 

timejMusIi]  -  i/dock_fteqtteDcy; 

&eq_azis{il  =  i  *  dock_ft«qiieacy/POINTS; 

) 

piiatfrSave  aa  ontpot  data  file?  (y/a)  "); 

i  =  getefaO: 
if  0=’/  n  i=.’r)  ( 

priBtfrName  of  OUTPUT  FILE:  *); 
acaafr%*’iaaiiie); 
fp  -  fapea(faaiiie,’w*); 
filejlagwl; 

priatfCDeaned  Network  Ootpot'  ’); 
route  %ir,ftaet_atit); 


/•  lailialiae  die  PCLAB  rootiaei  aad  kUrm  boadtl:  */ 

ieitialixeO: 

teiec«_ba«d(l); 

ence  *  aeap_wic(TIMING_SOURCE.  START.CHANNEL,  END.CHANNEL,  GAIN): 
cnor  B  teiw<lacl(_freqiieacy(AckicmKqDe«ey): 


tyMemCcb'); 

pdi^Yaar  ofiliaat  «e:\B*): 

pHalfr  (t»  Qinr.arVa*): 

piiMfr  0)  Plot  nME-DOMAIN  (teu.  of)a’); 

ptiMfC  a)  Plot  FREQUENCY-DOMAIN  diie  WITH  Hauieg  Wiadow.Va’): 

pnMff  0)  Plot  FR£QUPNCY-DC»!AlNdMa  WITHOUT  HaMiagWiMlow.Vii*); 

pnatfCOptioa:  *); 

•cuff  «a*,Aap(ioa); 

)  while  (ofitioaO  II  aplioii>3); 

while  (ofNioolsO)  ( 

/*  Peifonii  a  eeriei  A/D  ooaveteio*;  */ 

dinUe_eyiiein_dackO; 

emr  z  adc_eedei  (POINTS,  a4_dau); 

eBatile_eyfieiQ_clockO: 

for  6=0;  kPOINTS;  h-«-) 

dalali]  =  (ad_dala|i)-204SV204  8; 

ewiich  (option)  (  , 

caee  1: 

ploCdiu  (tiaie_axii.  data.  POINTS,  2); 
if(filejla8)( 

foe  0=0,  j=0;  i<fOINTS;  i++,  j=('t-*-j)fl65)  { 
fiai«f(fp.''«,5e  '.data(i]); 
if  0^  AA  il=0  &A  i<POINTS-l) 
fpciBtf(fjp,"\iiA  ’); 

J 

fpiiatf(fp,’  %,Slf\a’jiet_aiit); 

» 

btcak; 

caie  2: 

Haaiiiog_xeal  (<bu,  POINTS); 

case  3: 

FFT_inagaitnde  (data,  POINTS); 
plot_data  (fieq_aus,  data,  POINTS/2.  2); 
if  (filejbg)  I 

foe  0*0,  j=0;  i<POINTS/2;  M-f,  j=<++j)W5)  { 
fixutf(fp^’%,5e  ”,data(i]): 
if  (j=0  A&  il=0  AA  kPOINTS/M) 
Iiaiatf(ip,’\aA  '); 

I 

fptadifp'  %JIf\n',aet_oot); 

) 

beeak; 


if  (kfahitO)  ( 

getchO; 
while  (kbhitO) 

feiebO: 

aeOnodefteit'O; 


